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RAW SEQUENCE LISTING 
ERROR REPORT 





The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 
Source: 

Date Processed by STIC: 



6 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 
PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTTN 2.1 e-mail help: Datin21help@usDto.gov or phone 703-306-4119 (R. Wax) 
PATENTTN 3.0 e-mail help: patin3help(atuspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM. ACCESSIBLE THROUGH THE U. S. PATENT AND w 
TRADEMARK OFFICE WEBSITE. SEE BELOW: m 

H 

Checker Version 3.0 g 

The Checker Version 3.0 application is a state-of the-art Windows based software program — 

employing a logical and intuitive user-interface to check whether a sequence listing is in 

compliance with format and content rules. Checker Version 3.0 works for sequence listings W 

generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old (^j 

rules) and the revised version (new rules) effective July 1 , 1 998 as well as World Intellectual 

Property Organization (WIPO) Standard ST.25. q 

Checker Version 3 .0 replaces the previous DOS-based version of Checker, and is Y2K- -g 

compliant. Checker allows public users to check sequence listings in Computer Readable form -< 

(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 

Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 

listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the US PTO website at the following address: 

http://www.uspto.gov/web/offices/pac/checker 



PAGE: 1 RAW SEQUENCE LISTING DATE: 07/06/2001 

PATENT APPLICATION US/09/856,796 TIME: 04:30:40 



INPUT SET: S36582.raw 



This Raw Listing contains the General 
Information Section and those Sequences 
containing ERRORS. 



-> 
-> 
-> 
-> 
-> 
-> 



-> 



i 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 



fiAL T /UfOlLTMmM SEQUENCE LISTING 
general Information^ 



fa 

Does Not Comply 
Corrected Diskette Needed 



general iniormatiorjjE » m A ^ 

(ft) NAME " ! CENTRE NATIONAL DE LA RECHERCHE SCIENTIPIQUE J 



(ii) 



( D ) OTREET i — — r u e Mi c h e l - A nge 
CITY 8 PARIO — 

CUUNTKV! FRANCE 

(P) POSTAL CODE ; 757 9 4 CEDEX 10 

TITLE OF JPHfi INVENTION: NF-?B ACTIVATION INHIBITORS, 
THEIR PHARMACEUTICAL USES 



AND 



(iii) NUMBER OF SEQUENCES: 4 



fU) <&v) COMPUTER READABLE FORJ£>W r J ^ 
(A) TYPE OF OUrrORT fFloppy disk 



Ovi CORRESPOND^JCE ADDRESS: 
(A) ADDRESSEE: 
CB) STREET: 
(O OTY: 
(D) STATE: 
(D COUNTR Y: 
<F)Z1P: 



(B) COMPUTER: I BM PC compatible ^ Qff /? ^jJ^Qr £ UCyjZ^ 

(C) UCEH CYOTDMl ^PC-DOS/MS-DOS J 

(D) SOFTWARE: Patentln Release #1.0, Version #1 . 30(T6EB) 



(vO CURRENT APPUCATION DAJA : 

(A) APPUCATION NUMBER- 

(B) FILING DATE: 




ERRORED SEQUENCES FOLLOW: 



— > 



-> 
-> 



--> 



-> 
-> 
-> 



24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 609 base pairs ^ ^ A A /Ccl* 

(B) TYPE: nucleotide ^ STfcW^*'*»' 
fni rffflMBER OF STRANPsT double 

(D) CONFIGURATION: linear 



v 



(ii) TYPE OF MOEECULB : DNA( genomic) 

y>\i9u£6u-<x rye/ 

(ix) CIIftnftOTFRICTICM 

(A) NAME/KEY: CDS 

(B) POSITION: 1 . . 609 




far* 



A* 

a- 




PAGE: 2 RAW SEQUENCE LISTING DATE: 07/06/2001 

PATENT APPLICATION US/09/856,796 TIME: 04:30:40 

INPUT SET: S36582.raw 

— > 40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

41 



A O 

4 I 

A O 

4 J 
44 


ATG 

A O 


GCT 


AC A 


c*\ /"i /-I 

GGC 


1 LL 


CGG 


AC Li 


1 LL 


C 1 G 


L1L 


Lib 


ririm 
uL 1 


TTT 




/imp 
Liu 


nrnp 

Lr 1 L. 


Mex 


Ala 


Thr 


Giy 


Ser 


Arg 


inr 


O «-w -r- 

ber 


Leu 


Leu 


Lgu 


A "1 o 

A±a 


Fne 


Giy 


Leu 


Leu 


45 


1 








b 










1 u 










1 o 




46 


































47 


TGC 


CTG 


CCC 


TGG 


CTT 


CAA 


GAG 


GGC 


AG I 


/~i /~% r~\ 

GCC 


1 iC 


CCA 


a r^o 
ACC 


a mm 
Al 1 


CCC 


mm A 

1 1A 


48 


96 
































49 


Cys 


Leu 


Pro 


Trp 


Leu 


Gin 


Glu 


Gly 


Ser 


A "1 *a 

Ala 


Fne 


Pro 


Thr 


lie 


Pro 


Leu 


50 








20 










I 5 










JU 






51 


































52 


TCC 


AGG 


CTT 


TTT 


GAC 


AAC 


GCT 


AGT 


CTC 


CGC 


GCC 


CAT 


CGT 


i-y m /"I 

CTG 


nun 

CAC 


CAG 


53 


144 
































54 


Ser 


Arg 


Leu 


Phe 


Asp 


Asn 


Ala 


Ser 


Leu 


Arg 


Ala 


His 


Arg 


Leu 


His 


Gin 


55 






35 










40 










4 5 








56 


































57 


































58 


CTG 


GCC 


TTT 


GAC 


ACC 


TAC 


CAG 


GAG 


TTT 


AAC 


CCC 


CAG 


ACC 


TCC 


CTC 


TGT 


59 


192 
































60 


Leu 


Ala 


Phe 


Asp 


Thr 


Tyr 


Gin 


Glu 


Pne 


Asn 


Pro 


Gin 


Thr 


Ser 


Leu 


Cys 


61 




50 










55 










60 










62 


































63 


TTC 


TCA 


GAG 


TCT 


ATT 


CCG 


AC A 


CCC 


TCC 


AAC 


AGG 


GAG 


GAA 


AC A 


CAA 


CAG 


64 


240 
































65 


Phe 


Ser 


Glu 


Ser 


He 


Pro 


Thr 


Pro 


Ser 


Asn 


Arg 


Glu 


Glu 


Thr 


Gin 


Gin 


66 


65 










70 










75 










80 


67 


































68 


AAA 


TCC 


AAC 


CTA 


GAG 


CTG 


CTC 


CGC 


ATC 


TCC 


CTG 


CTG 


CTC 


ATC 


CAG 


TCG 


6 9 


2 88 
































70 


Lys 


Ser 


Asn 


Leu 


Glu 


Leu 


Leu 


Arg 


lie 


Ser 


Leu 


Leu 


Leu 


lie 


Gin 


Ser 


71 










85 










90 










95 




72 


































73 


TGG 


CTG 


GAG 


CCC 


GTG 


CAG 


TTC 


CTC 


AGG 


AGT 


GTC 


TTC 


GCC 


AAC 


AGC 


CTG 


74 


"5 "5 H 

3 36 
































75 


Trp 


Leu 


Glu 


Pro 


vai 


Gin 


File 


Leu 


Arg 


Ser 


vai 


pne 


Ala 

Aia 


Asn 


Ser 


Leu 


76 








100 










105 










110 






77 


































78 


GTG 


TAC 


GGC 


GCC 


TCT 


GAC 


AGC 


AAC 


GTC 


TAT 


GAC 


CTC 


CTA 


a a r* 
AAC 


GAC 


CIA 


7 9 


3 84 
































80 


Val 


Tyr 


Gly 


Ala 


Ser 


Asp 


Ser 


Asn 


Val 


Tyr 


Asp 


Leu 


Leu 


Lys 


Asp 


Leu 


81 






115 










120 










125 








82 


































83 


GAG 


GAA 


GGC 


ATC 


CAA 


ACG 


CTG 


ATG 


GGG 


AGG 


CTG 


GAA 


GAT 


GGC 


AGC 


CCC 


84 


432 
































85 


Glu 


Glu 


Gly 


He 


Gin 


Thr 


Leu 


Met 


Gly 


Arg 


Leu 


Glu 


Asp 


Gly 


Ser 


Pro 


86 




130 










135 










140 










87 


































88 


CGG 


ACT 


GGG 


CAG 


ATC 


TTC 


AAG 


CAG 


ACC 


TAC 


AGC 


AAG 


TTC 


GAC 


AC A 


AAC 


89 


480 
































90 


Arg 


Thr 


Gly 


Gin 


He 


Phe 


Lys 


Gin 


Thr 


Tyr 


Ser 


Lys 


Phe 


Asp 


Thr 


Asn 


91 


145 










150 










155 










160 



9? 



(/1M aJ^~1 

(fK £a^T^ 

JU) 



92 



PAGE: 3 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/856,796 



DATE: 07/06/2001 
TIME: 04:30:41 



INPUT SET: S36582. raw 

93 TCA CAC AAC GAT GAC GCA CTA CTC AAG AAC TAC GGG CTG CTC TAC TGC 

94 528 

95 Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr Gly Leu Leu Tyr Cys 

96 165 170 175 
97 

98 TTC AGG AAG GAC ATG GAC AAG GTC GAG AC A TTC CTG CGC ATC GTG CAG 

99 576 

100 Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe Leu Arg lie Val Gin 

101 180 185 190 
102 

103 TGC CGC TCT GTG GAG GGC AGC TGT GGC TTC TAG 

104 609 

105 Cys Arg Ser Val Glu Gly Ser Cys Gly Phe '* 

106 195 200 
107 

108 



109 (2) INFORMATION FOR SEQ ID NO: 2: 
110 

111 (i) SEQUENCE CHARACTERISTICS: 

112 (A) LENGTH: 202 amino acids 

113 (B) TYPE: amino acid 

— > 114 ( D ) -CONFI OUR ATI ON : linear 

115 T°foLC(rH'< 

116 (ii) MOLECULE TYPE : protein 

— > 117 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

118 

119 Met Ala Thr Gly Ser Arg Thr Ser Leu Leu Leu Ala Phe Gly Leu Leu 

120 15 10 15 
121 

122 Cys Leu Pro Trp Leu Gin Glu Gly Ser Ala Phe Pro Thr lie Pro Leu 

123 20 25 30 
- 124 

125 Ser Arg Leu Phe Asp Asn Ala Ser Leu Arg Ala His Arg Leu His Gin 

126 35 40 45 
127 

128 Leu Ala Phe Asp Thr Tyr Gin Glu Phe Asn Pro Gin Thr Ser Leu Cys 

129 50 55 60 
130 

131 Phe Ser Glu Ser lie Pro Thr Pro Ser Asn Arg Glu Glu Thr Gin Gin 

132 65 70 75 80 
133 

134 Lys Ser Asn Leu Glu Leu Leu Arg lie Ser Leu Leu Leu lie Gin Ser 

135 85 90 95 
136 

137 Trp Leu Glu Pro Val Gin Phe Leu Arg Ser Val Phe Ala Asn Ser Leu 

138 100 105 110 
139 

140 Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp Leu Leu Lys Asp Leu 

141 115 120 125 
142 

143 Glu Glu Gly lie Gin Thr Leu Met Gly Arg Leu Glu Asp Gly Ser Pro 

144 130 135 140 



PAGE: 4 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/856,796 



DATE: 07/06/2001 
TIME: 04:30:41 



INPUT SET: S36582.raw 

145 

146 Arg Thr Gly Gin lie Phe Lys Gin Thr Tyr Ser Lys Phe Asp Thr Asn 

147 145 150 155 160 
148 

149 Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr Gly Leu Leu Tyr Cys 

150 165 170 175 
151 

152 Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe Leu Arg lie Val Gin 

153 180 185 190 
154 

155 Cys Arg Ser Val Glu Gly Ser Cys Gly Phe 

156 195 200 
157 



— > 



-> 
-> 



-> 
-> 
-> 



— > 



158 
159 
•160 
161 
162 
163 
164 
165 
166 
167 
168 
169 
170 
171 
172 
173 
174 
175 
176 
177 
178 
179 
180 
181 
182 
183 
184 
185 
186 
187 
188 
189 
190 
191 
192 
193 
194 
195 
196 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 a m i n o acids 

(B) TYPfU- nucleotide 

urns: 



(c 



( d Configuration ^ linear 

(ii) MOLECULE TYPE: DNA (genomic) 



BER OF STB&NB S; double 



\RACTERISTIQ> Jff I 
\) NAME/KEY: CDSM 




( ix^^O&RACTERISTIQ 
1*7 

(B) POSITION: 1 



582 



ATG 

48 

Met 



CTG 
96 
Leu 
220 

ATC 
144 
He 



GCC 
192 
Ala 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3; 

GGG GTG CAC GAA TGT CCT GCC TGG CTG TGG 

Gly Val His Glu Cys Pro Ala Trp Leu Trp 
205 210 



CTT CTC CTG TCC CTG^v 



Leu . Leu Leu Ser Leu 
215 



TCG CTC CCT CTG GGC CTC CCA GTC CTG GGC GCC CCA CCA CGC CTC 



Ser Leu Pro Leu Gly Leu Pro Val Leu Gly 
225 230 



Ala Pro Pro Arg Leu 
235 



0 pAM^J 



TGT GAC AGC CGA GTC CTG GAG AGG TAC CTC TTG GAG GCC AAG GAG 



Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
240 245 



Leu Glu Ala Lys Glu 
250 



GAG AAT ATC ACG ACG GGC TGT GCT GAA CAC TGC AGC TTG AAT GAG 



Glu Asn He Thr Thr Gly Cys Ala Glu His 
255 260 



Cys Ser Leu Asn Glu 
265 



AAT ATC ACT GTC CCA GAC ACC AAA GTT AAT TTC TAT GCC TGG AAG AGG 



PAGE: 5 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/856,796 



DATE: 07/06/2001 
TIME: 04:30:41 



INPUT SET: S36582.raw 



197 


o a a 

240 
































198 


Asn 


lie 


Thr 


val 


Pro 


Asp 


Thr 


Lys 


val 


Hah 

Asn 


Pne 


Tyr 


Ala 


Trp 


Lys 


Arg 


199 






270 










275 










1 O A 

zoU 








200 


































201 


ATG 


GAG 


GTC 


GGG 


CAG 


CAG 


GCC 


GTA 


GAA 


GTC 


TGG 


CAG 


/~i /-I /I 

GGC 


CTG 


GCC 


C IG 


202 


288 
































203 


Mex 


Glu 


Val 


Gly 


Gin 


Gin 


Ala 


Val 


Glu 


val 


Trp 


Gin 


Gly 


Leu 


Ala 


Leu 


204 




285 










o a a 










O Q C 

z y d 










205 


































206 


CTG 


TCG 


GAA 


GCT 


GTC 


CTG 


CGG 


GGC 


CAG 


GCC 


CTG 


TTG 


GTC 


A AC 


TCT 


TCC 


207 


336 
































208 


Leu 


Ser 


Glu 


Ala 


Val 


Leu 


Arg 


Gly 


Gin 


Ala 


Leu 


Leu 


Val 


Asn 


Ser 


Ser 


209 


300 










305 










310 










315 


210 


































211 


CAG 


CCG 


TGG 


GAG 


CCC 


CTG 


CAG 


CTG 


CAT 


GTG 


GAT 


AAA 


GCC 


GTC 


AGT 


GGC 


212 


384 
































213 


Gin 


Pro 


Trp 


Glu 


Pro 


Leu 


Gin 


Leu 


His 


Val 


Asp 


Lys 


Ala 


Val 


Ser 


Gly 


214 










320 










325 










330 




215 


































216 


CTT 


CGC 


AGC 


CTC 


ACC 


ACT 


CTG 


CTT 


CGG 


GCT 


CTG 


GGA 


GCC 


CAG 


AAG 


GAA 


217 


432 
































218 


Leu 


Arg 


Ser 


Leu 


Thr 


Thr 


Leu 


Leu 


Arg 


Ala 


Leu 


Gly 


Ala 


Gin 


Lys 


Glu 


219 








335 










34 0 










345 






220 


































221 


GCC 


ATC 


TCC 


CCT 


CCA 


GAT 


GCG 


GCC 


TCA 


GCT 


GCT 


CCA 


CTC 


CGA 


ACA 


ATC 


222 


480 
































223 


Ala 


lie 


Ser 


Pro 


Pro 


Asp 


Ala 


Ala 


Ser 


Ala 


Ala 


Pro 


Leu 


Arg 


Thr 


lie 


224 






350 










355 










360 








225 


































226 


ACT 


GCT 


GAC 


ACT 


TTC 


CGC 


AAA 


CTC 


TTC 


CGA 


GTC 


TAC 


TCC 


AAT 


TTC 


CTC 


227 


528 
































228 


Thr 


Ala 


Asp 


Thr 


Phe 


Arg 


Lys 


Leu 


Phe 


Arg 


Val 


Tyr 


Ser 


Asn 


Phe 


Leu 


O O Q 

22 9 




"5 C C 

3 65 










OTA 










■JTC 

J / 0 










230 


































231 


CGG 


GGA 


AAG 


CTG 


AAG 


CTG 


TAC 


ACA 


GGG 


GAG 


GCC 


TGC 


AGG 


ACA 


GGG 


GAC 


232 


576 
































233 


Arg 


Gly 


Lys 


Leu 


Lys 


Leu 


Tyr 


Thr 


Gly 


Glu 


Ala 


Cys 


Arg 


Thr 


Gly 


Asp 


234 


380 










385 










390 










395 


235 


































236 


AG A 


TGA 






























237 


582 
































238 


Arg 
































239 


































240 


































241 




































242 (2) INFORMATION FOR SEQ ID NO: 4: 
243 

244 (i) SEQUENCE CHARACTERISTICS: 

245 (A) LENGTH: 193 amino acids 

246 (B) T^PE: amjja o ac 4»d 

-> 247 (DJ^^NFIGu^TIONj^Jinear 

248 



PAGE: 1 



SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/09/856,796 



DATE: 07/06/2001 
TIME: 04:30:42 



INPUT SET: S36S82.mw 



Line 


Error 


Original Text 




c 

J 


Unknown or Misplaced Identifier 


(i) DEPOSITOR: 




0 


Unknown or Misplaced Identifier 


(A) NAME: CENTRE NATIONAL DE LA RECHERCHE 


/ 


Unknown or Misplaced Identifier 


(B) STREET: 3, rue Michel-Ange 




Q 
O 


Unknown or Misplaced Identifier 


(C) CITY: PARIS 




Q 

y 


Unknown or Misplaced Identifier 


(E) COUNTRY: FRANCE 




10 


Unknown or Misplaced Identifier 


(F) POSTAL CODE: 75794 CEDEX 16 




I o 


Unknown or Misplaced Identifier 


(A) TYPE OF SUPPORT: Floppy disk 




ZO 


Unknown or Misplaced Identifier 


(C) USER SYSTEM: PC-DOS/MS-DOS 




LI 


cnterea \oVy) ana caic. oeq. Lengxn uiirer 


(A) LENGTH: 609 base pairs 




Z9 


Unknown or Misplaced Identifier 


(C) NUMBER OF STRANDS: double 




OA 

30 


Unknown or Misplaced Identifier 


(D) CONFIGURATION: linear 




32 


Unknown or Misplaced Identifier 


(ii) TYPE OF MOLECULE: DNA(genomic) 




35 


Unknown or Misplaced Identifier 


(ix) CHARACTERISTIC: 




36 


Unknown or Misplaced Identifier 


(A) NAME/KEY: CDS 




J / 


Unknown or Misplaced Identifier 


(B) POSITIONS.. 609 




/in 
40 


Wrong Or Missing Stranded ness Value 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 


1: 


Af\ 

40 


Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: I: 


1 1 A 

1 14 


Unknown or Misplaced Identifier 


(D) CONFIGURATION: linear 




1 1 / 


Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 


2: 


1 

1 _>o 


oiop v--ouoii cti cnu oi sequence reinovcu - no error 


(2) INFORMATION FOR SEQ ID NO: 3: 




161 


Entered (582) and Calc. Seq. Length (0) differ 


(A) LENGTH: 582 base pairs 




163 


Unknown or Misplaced Identifier 


(C) NUMBER OF STRANDS: double 




164 


Unknown or Misplaced Identifier 


(D) CONFIGURATION: linear 




169 


Unknown or Misplaced Identifier 


(ix) CHARACTERISTIC: 




170 


Unknown or Misplaced Identifier 


(A) NAME/ KEY: CDS 




171 


Unknown or Misplaced Identifier 


(B) POSITION: 1..582 




174 


Wrong Or Missing Strandedness Value 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 


3: 


174 


Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 


3: 


247 


Unknown or Misplaced Identifier 


(D) CONFIGURATION: linear 




249 


Unknown or Misplaced Identifier 


(ii) TYPE OF MOLECULE: protein 




250 


Wrong or Missing Sequence Topology 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 


295 


Stop Codon at end of sequence removed - no error 







PAGE: 1 SEQUENCE MISSING ITEM REPORT DATE: 07/06/2001 

PATENT APPLICATION US/09/856,796 TIME: 04:30:42 

INPUT SET: S36582.raw 



APPLICANT 

ADDRESSEE 

STREET 

CITY 

STATE 

COUNTRY 

ZIP 

CORRESPONDENCE ADDRESS 
MEDIUM TYPE 
OPERATING SYSTEM 
APPLICATION NUMBER 
FILING DATE 
CLASSIFICATION 
CURRENT APPLICATION DATA 
APPLICATION NUMBER 
FILING DATE 

PRIOR APPLICATION DATA 

STRANDEDNESS 

TOPOLOGY 



PAGE: 6 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/856,796 



DATE: 07/06/2001 
TIME: 04:30:42 



-> 
-> 



249 
250 

251 
252 
253 
254 
255 
256 
257 
258 
259 
260 
261 
262 
263 
264 
265 
266 
267 
268 
269 
270 
271 
272 
273 
274 
275 
276 
277 
278 
279 
280 
281 
282 
283 
284 
285 
286 
287 
288 
289 
290 
291 
292 
293 
294 
295 
296 
297 
298 
299 



INPUT SET: S36582.ra w 



(ii) < $XEE_OFMOLECUMU' protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4; 



Met Gly Val His Glu Cys Pro Ala Trp Leu Trp Leu Leu Leu Ser Leu 
15 10 15 

Leu Ser Leu Pro Leu Gly Leu Pro Val Leu Gly Ala Pro Pro Arg Leu 
20 25 30 

lie Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu Leu Glu Ala Lys Glu 
35 40 45 

Ala Glu Asn lie Thr Thr Gly Cys Ala Glu His Cys Ser Leu Asn Glu 
50 55 60 

Asn lie Thr Val Pro Asp Thr Lys Val Asn Phe Tyr Ala Trp Lys Arg 
65 70 75 80 

Met Glu Val Gly Gin Gin Ala Val Glu Val Trp Gin Gly Leu Ala Leu 
85 90 95 

Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Leu Leu Val Asn Ser Ser 
100 105 110 

Gin Pro Trp Glu Pro Leu Gin Leu His Val Asp Lys Ala Val Ser Gly 
115 120 125 

Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin Lys Glu 
130 135 140 

Ala lie Ser Pro Pro Asp Ala Ala Ser Ala Ala Pro Leu Arg Thr lie 
145 150 155 160 

Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val Tyr Ser Asn Phe Leu 
165 170 175 

Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala Cys Arg Thr Gly Asp 
180 185 190 

Arg 



# 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION 
ATTN: WEW RULE S CASES 



SERIAL NUMBER: 




1 L^^ Wrappcd Nucleics 
Wrapped Aminos 



PLEASE DISREGARD ENGLISH "ALPHA** HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

he numbcrAext at the end of each line **wrappcd" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 



Jnvalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



3 Misaligned Amino 

T Numbering 



10 



11 



12 



Non-ASCII 



_ Variable Length 



Palcntln 2.0 
" "bug" 



7 Skipped Sequences 

(OLD RULES) 



The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text 

Sequcnce(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences^) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequences) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ED NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



8 Skipped Sequences Sequencers) _ 



missing. If intentional, please insert the following lines for each skipped sequence. 



(NEW RULES) 



_Use of n's or Xaa's 
(NEW RULES) 

_Invalid<213> 
Response 



<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Use of <220> 



Sequencc(s) 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Patentln 2.0 
""bug- 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See 'Tederal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



AMC - Biotechnology Systems Branch - 06/04/2001 



